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(57) Abstract: The present invention is directed to 
methods for making and using informative nucleic 
acid arrays (e.g.. DNA including cDNA, RNA, 
PNA) for research and other applications in various 
disciplines or areas of interest. Examples of such 
disciplines include, without limitation, dermatology, 
pharmacology, toxicology, oncology, gynecology, 
urology, gastroenterology, as well as studies of sentinel 
gene disco very» signature gene discovery, mechanism 
of action, drug screening, drug metabolism, etc. The 
informative nucleic acid arrays of the present invention 
may contain only the gene sequences that are of interest 
in a particular area of interest or apphcation and may 
exclude other gene sequences. 
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AND METHODS FOR MAKFXG SAME 



BACKGROUND OF T[^E INA'ENTION 
1 . Field of the ln\ ention 

The present in\-cnlion relates generally lo nucleic acid arrays and to 
methods for designing' and using nucleic acid arrass. In particular, the present 
nivention relates to nitonnativc nucleic acid arrays and methods for making the 
same, w 'nh an emphasis on the selection criteria for the individual gene sequences to 
be included in informari\ e nucleic acid aiTays. 
) 2. Description of the Related Art 

In general, most of the cells of an organism contain the same gene 
sequences. In eucaryotic organisms (i.e., those having a nucleus) the number of 
indi\'idual genes t\pically is in the range of tens of thousands of genes. All of these 
genes, howe\'er, are not used or expressed by all cells all of the time. Some genes 
are acti\ated, or expressed, only at a specific time, at a specific level, at a specific 
developmental stage, and/or in a specific cellular, physiologic, and/or tissue context. 
Determining when a gene is expressed, and what causes the gene to be expressed, 
may be key in better understanding the effects of various agents on cellular 
responses (e.g., potential phamiaccuticals, toxins, chemicals, temperature, pressure, 
or electromagnetic radiation, etc.). In addition, determining when a gene is 
expressed may yield a better understanding of the effects of various nomial or 
variant genes on disease pathogenesis (e.g., induced or hereditary disorders, etc.). 

To this end, a variety of molecular biology methods for classifying, 
indexing, and or quantirVing nucleic acid with regard to gene expression have been 
proposed. For example. U.S. Patent No. 5,707,807 relates to a method for 
classifying cDNA thai has been reverse-transcribed from tissue- or cell-derived 
RNA, The method is also applicable to the search and isolation of genes of 
physiologically active substances that are potential pharmaceuticals or causative 
genes of hereditary diseases, as well as the isolation of those genes that are useful 
for impro\-ing agricultural products. 
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Another example of gene expression analysis is in U.S, Patent N'o. 
5,96S,7S4. This patent relates to a method for taggin;^ and identifying all of the 
expressed genes ni a given cell population. By comparing gene expression protlles 
among cells, the method may be used to identify indi\'idual genes whose expression 
IS associated with a pathological phenotype. U.S. Patent No. 5,968,784 also relates 
to methods for identif\'ing gene expression palicms in an mRNA population to 
identify differential gene expression patterns among two or more cells or tissues. 

.Another useful contemporary mcthodolog\' for analyzing 
simultaneously a plurality of genes for gene expression levels utilizes nucleic acid 
airays (or microarrays or macroarrays, hereinafter collectivelyjcfcrred to as arrays). 
DNA arrays typically consist of hundreds to thousands of immobilized DNA 
sequences present on a surface of an object the size of a business card or smaller. 
The nucleic acids for the selected individual gene sequences are immobilized on the 
surfaces of nylon filters, glass, plastic, or gene chips, etc. Robotic technologies may 
be employed in the production of arrays. Labeled probe samples are prepared from 
R>.'A from biological samples. The probes arc hybridized to the immobilized 
nucleic acids on the arrays, and a detector instrument collects the intensities of 
hybridization of the bound labeled probe sample to the individual gene sequences. 
Then, computer software t>pically analyses the results. For example, U.S. Patent 
N'o. 5,807,522 relates to a method and apparatus for forming arrays of biological 
samples on a support in an automated fashion. U.S. Patent No. 5.922,617 relates to 
a product having arrays of samples in tracks, wherein light emitting labels are 
excited and emitted light is detected. For example, the arrays may be located in 
circular tracks on a compact disc-like support. In addition, U.S. Patent No. 
5.922,61 7 relates to a reader for determining the occurrence of events on the array. 

With regard to the methodologies for selecting individual gene 
sequences for inclusion on arrays, currently available gene expression arrays 
tvpically consist of one of two types of gene selection methods. In one case, gene 
sequences that are merely available to the designer are immobilized on an array, 
regardless of any biological significance of the selected gene sequences or as the 
result of experimentation. In the other case, gene sequences that are expressed in 
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one coll i>pe or tissue i;.pe are chosen for inclusion on the array. Both of these 
conventional gene selection methodologies produce general utility arra\*s that have 
limited ^"infomiatiN e" capabilities. This disadvantage is especially pronounced if the 
general arrays contain a relatively small number of immobilized gene sequences, 
tlius reducing the likelihood that one will discover differentially expressed gene 
sequences durmg experimcnration. Although arrays are becoming more widely 
knou'n and used. \er> little attention has been paid to the creation or use of 
infomiaii\*e nucleic acid arrays. 
SL^MMAR^' OF THE IW'ENTION 

Thus, a need has arisen for informative nucleic acid arrays, and for 
methods for selecting the individual gene sequences for inclusion on the informative 
arrays, and for making the same. 

As embodied and broadly described herein, the present invention is 
directed to methods for making and using informative nucleic acid arrays (e.g., DNA 
including cDNA, RNA, PNA) for research and other applications in various 
disciph'nes or areas of interest. Examples of such disciplines include, without 
limitation, deniiatology, pharmacology, toxicology, oncology, gynecology, urology, 
gastroenterology, as well as studies of sentinel gene discovery, signature gene 
discovers', mechanism of action, drug screening, drug metabolism, etc. The 
intbrmative nucleic acid arrays of the present invention may contain only the gene 
sequences that are of interest in a particular area of interest or application, and may 
exclude other gene sequences. 

According to one embodiment of the present invention, a method for 
identifying genes that are differentially expressed between dissimilar biological 
samples for use in an infon-natiN-e array is disclosed. The method includes the steps 
of (1) providmg a first set of heterogeneous nucleic acid probes derived from a first 
biological sample; (2) providing a second set of heterogeneous nucleic acid probes 
derived from a second biological sample wherein the first and second biological 
samples are different and have a common biological process; (3) hybridizing a 
nucleic acid array compnsing a plurality of sequences derived from genes of the 
biological process with the first set of probes and determining a first level of 
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expression for sequences of the array: (4) hybridizing the array with the second set 
of probes and deiennming a second level of expression for sequences of the array; 
and (5) identifying a plurality of genes that are differentially expressed in the 
biological process b>' comparing the first level of expression with the second le\ el of 
expression for hybndized sequences. The biological process may include the 
processes of adsorption, distribution, metabolism, and excretion of drugs, toxins and 
chemicals, and the biological process may affect the behavior of endogenous or 
exogenous chemicals in cells. The biological process may also be related to 
dermatolog\-, pharmacology, toxicology, pathology, oncology, gastroenterology, 
urology, or g\Ticcolog> . 

In one embodiment, the first and second biological samples may 
originate from skin, skin appendages, oral tissue, gastrointestinal tissue, neural 
tissue, renal tissue, hepatic tissue, and/or urogenital tissue. 

in one embodiment, a database may be created that includes the 
differentially expressed genes identified by the method above. In another 
embodiment, an informative array of sequences on a solid support may be created 
including sequences that are derived from the differentially expressed genes 
identified by the method above. 

In another embodiment of the present invention, a method for 
selecting genes for an informatixe nucleic acid array for a biological process is 
provided. The method includes the steps of (1) identifying genes that are 
differentially expressed in the biological process; (2) establishing a ranking of the 
differentially expressed genes, wherein genes having a moderate level of expression 
are ranked o\'er genes having a lower level of expression and genes having a higher 
level of expression; and (3) placing sequences derived from ranked differentially 
expressed genes on the informative array. The genes may be expressed in the 
biological process. 

In one embodiment, the step of identifying may include the steps of 

(1) providing a first sei of nucleic acid probes derived from a first biological sample; 

(2) providing a second set of nucleic acid probes derived from a second biological 
sample, wherein the second biological sample is dissimilar from the first biological 



sample; (3) h\bridi/.ing ::ie firsi set of probes to llie niicroamiy and dclcrmininu a 
firsi level oi expression for hybridized uenes; (4) hybridizing the second se: of 
probes to the nucroarray and delcn-nining a second le\ei of expression for 
hybridized genes; and (5) identifying genes that are diffcrentialK' expressed b\' 
comparing the first ie^ ci of expression with the second level of expression for 
hybridized genes. 

According lo another embodiment of the present invention, a method 
for converting a nucleic acid array inio an infomiaiive array is disclosed. The 
method includes the steps of (1) providing a first set of heterogeneous nucleic acid 
probes derived from a first biological sample; (2) providmg a different, second set of 
heterogeneous nucleic acid probes denved from a second biological sample; (3) 
hybridizing a nucleic acid array comprising a plurality of sequences with the first set 
of probes and detemiinmg a first level of expression for sequences of the array; (4j 
hybridizmg the array with the second set of probes and determining a second le\ el 
of expression for sequences of the array; (5) identifying a plurality of genes that are 
differentially expressed in the biological process by comparing the first level of 
expression with the second level of expression for hybridized sequences; and (6) 
selecting moderately expressed genes from the plurality of identified differentially 
expressed genes for inclusion on the informative array. 

According to yet another embodiment of the present invention, an 
informati\e nucleic acid arra\' for gene expression analysis is disclosed. The 
informative nucleic acid array includes sequences derived from genes of cells that 
are substantially relevant to a biological process; sequences derived from a plurality 
of differentially expressed genes that are relevant lo the biological process as 
identified by the method for identifying genes that are differentially expressed 
between dissimilar biological samples for use in an informative array discussed 
above; and a platfonn for the sequences which is selected from the group consisting 
of a filter surface, a glass surface, a plastic surface, a solid bead surface, and a gene 
chip surface. 

[t is a technical advantage of the present invention for informariv'e 
arrays to contain immobilized gene sequences that have been carefully selected from 
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a larycr set of candidate genes. It is another technical advantage of the present 
nnention for infomiatixe arrays to increase the hkelihood that the gene sequences 
inmiobihzed on it will be more informative (e.g., differentially expressed) in a 
desired application, relative to a general array lacking a similar level of informative 
5 potential. It is another technical advantage of the present invention for infom^ative 
arravs to increase the likelihood of identifying biomarkers, consisting of 
differentiall}' expressed genes. Furthemiore, it is another technical advantage of the 
present mvention for informati\'e arrays to permit reduction in the total number of 
gene sequences immobilized on the informative array. A reduction in the number of 
10 gene sequences may result in a reduction in the size of the informative array, due to 
the exclusion of non-informative genes from the list of candidate genes during the 
gene selection process. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present invention, the 
15 objects and advantages thereof, reference is now made to the following descriptions 
taken in connection with the accompanying drawings in which: 

Fig, 1 depicts a flowchart representing a possible method for 
producing an informative nucleic acid array according to one embodiment of the 
present invention; 

20 Fig. 2 depicts a plot of an experimental result using an informative 

array according to one embodiment of the present invention: and 

Fig. 3 depicts another plot of an experimental result using an 
infonnative nucleic acid array according to one embodiment of the present 
invention. 

25 DETAILED DESCRIPTION OF THE INVENTION 

The preferred embodiment of the present invention and its advantages 
are best understood by referring to Figs. 1-3 of the drawings. 

It should be noted that, although the present invention is described 
with regard to human or mammalian genes, it is not so limited. The present 
30 in\-ention contemplates applications with a wide variety of unicellular and multi- 



cellular organisms, including animals, plants, protisis. fungi, etc., both living and 
dead (e.g., cr>ogentcall> -preserv ed materials from formerly living materials). 

Referring to Fig. L a novvchart depicting a method for producing 
ii^formativ'e nucleic acid arra\'S f'smai*! arrays") according lo one embodiment of the 
present in\-ention is provided. The array may include isolated or purified 
immobilized nucleic acids, cither native or synthetically created sequences, such as, 
for example, oligonucleotides, fragments, partial and fuU-iength cDNAs, expressed 
sequence tags (ESTsj, mcluding both partial and full-lengih ESTs. As used herein, 
the phrase "nucleic acid" means any scries of nucleotide sequences connected via a 
chemical backbone such as polydeoxyribose, polyribose. or polyamidc, and is 
preferably DNA, RNA. or PN,A. The figure schematically outlines a possible 
pathway for the creation and use of an informative gene list and nucleic acid array, 
involving 5 general steps, identified as I-V. Step I involves preparation of the 
biological materials. Step II invoh es the use of a nucleic acid gene expression array 
to identify differentially expressed genes. Step III involves analysis of the data sets 
generated from the nucleic acid array experiments. Step IV involves selection of the 
most informative and relevant genes for the desired application, and the subsequent 
creation of the informative nucleic acid array. And, Step V involves use of the 
informative nucleic acid array to identify biomarkers. Fig. 1 serv^es as an example, 
and is not intended to be restrictiv e in scope, and the invention is not so limited. 

The tenns "genes" and '^gene sequences." as used tliroughout this 
document, includes full-length sequences of genes as well as partial gene sequences 
and fragments of gene sequences. A gene is any sequence of nucleotides that either 
detects or encodes expression of a protein or a protein fragment. 

In Step I, biological samples are provided. Examples of preferred 
biological samples include samples of skin, skin appendages, oral tissue, 
gastrointestinal tissue, neural tissue, renal tissue, hepatic tissue, urogenital tissue, 
cells, cell cultures, cell lines, tissues, organs, organelles, biological fluids, and/or 
whole organisms. 

In one embodnnent, the biological samples may be different, or 
dissimilar, samples. In another embodiment, the biological samples may have a 
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common biological process, such as adsoq^tion, disiribution, metabolism, and 
^\\cretioii oi' dru^s, loxins and chemicals. The biological process may affec: the 
beha\ ior of endogenous or exogenous chemicals in cells. 

In another embodiment, the biological process may be substamially 
related to dermatology, pharmacology, toxicology, pathology, oncology, 
gastroenterolog\', urology, or gynecology. Although these studies are identified, the 
present invention is not so limited. 

In yet another embodiment, three or more biological samples are 

preferred. 

In Steps II & in, expressed gene sequences may be identified. In one 
embodiment, this may include screening at least one nucleic acid array using at least 
one probe to identify expressed gene sequences. In one embodiment, chemically- 
labeled or radiolabeled (or tagged) probes that may be derived from RNA samples of 
biological samples may be used to identify expressed gene sequences or cross- 
hybridizing members of a gene family. 

In Step II, a set of heterogeneous nucleic acid probes are derived 
from each biological sample provided in Step 1. Next, a nucleic acid array is 
hybridized with each set of probes, and a level of expression for sequences of the 
array is determined and recorded. This may be accomplished with any number of 
commercially a\ailable detector device, such as, for example, a phosphor imager, 
autoradiographic film, or a fluorescence microscope, all of which are commercially 
available. 

In one embodiment, it is preferred to perform three or more 

experiments. 

In Step III, data analysis may be employed to identify informative 
differentially expressed genes. In one embodiment, differences between the 
measured levels of expression for each set of probes is determined. This may be 
achieved through any suitable method, including determining a hybridization 
intensity difference between the first level of expression and the second level of 
expression, establishing a ratio between the first le\-el of expression and the second 
level of expression, establishing a ratio threshold for the first level of expression and 
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[he second le\cl of expression, or determining a percent standard deviation for tiie 
first le\'el of expression and the second level of expression. It is also possible to use 
an algorithm to quantify differences between the levels of expression, to perform 
cluster analysis on the ievels of expression, or to perfomi dendrogram analysis on 
5 the levels of expression. 

As discussed above, it may be advantageous and/or preferred to 
include a minimum of three or more dissimilar biological samples. In this example, 
an investigator attempts to identif\' examples of gene sequences wherein an 
individual gene is substantial!}' over- or under-expressed in one sample but not in 
10 another. By companng three or more samples, one may more readily characterize 
tb.e specificity, selectivit\\ and Innitation of the particular differential expression 
response in a plurality of biological samples or experimental contexts. Conversely, 
these analysis methods may also be used to detect individual gene sequences whose 
expression patterns are similar when comparing dissimilar probe samples, which is 

1 5 most often the case. 

The identified differentially expressed genes may be ranked. In 
general, the ranking may be determined by comparing the standardized (or 
normalized) mtensities of an individual gene compared between two or more 
dissimilar biological samples. The standardized or normalized value for the 

20 expression of any panicular gene is arbitrary as it is simply used for comparati\*e 
purposes. Thus, the value obtamed froin any first set of probes can be used as the 
standard to which subsequent sets are compared. 

In one embodiment, it may be preferable to establish a ratio between 
the levels of expression may be used to rank the genes. For example, the larger level 

25 of expression may be compared to the smaller level of expression, yielding a ratio of 
expression. It may be preferred to rank genes having a ratio funher from 1.0 over 
genes having a ratio closer to 1 .0. and more preferable to rank genes having a ratio 
of less than O.S and greater than 1.2 over other genes. For instance, an individual 
gene might be represented as 1.6 x over-expressed in one sample relative to the 

3'.^ other, based on the experimental results. Other multipliers, such as l.S, 2.2, and 3.0 
may be used. .Arbitrarv' or expenmentally determined ratio thresholds may be used 
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to cstablisli cnieria for inclusion and exclusion of individual candidate gene 
sequences in ihc subsequeni ereauon of an informative nucleic acid arra\'. For 
mstance. one ma>' chose onl\* those gene sequences that are over-expressed or under- 
expressed by 1.6 < (e.g.. cither 1 60 percent or 62.5 percent of the reference sample), 
l.S, 2.2.. 3.0, etc. may be used. These numbers, however are not intended to be 
restrictive. 

In another embodiment, the ranking may be determined based on the 
difference in the le\'els of expression. For example, in one embodiment, genes 
having a higher difference arc ranked over genes having a lower difference. Before 
ranking, an absolute value of the difference may be taken such that the absolute 
value of the difference is used to rank the genes. 

In another embodiment, a standard deviation for the levels of 
expression may be used to rank the genes. For example; genes having a higher 
standard deviation may be ranked over genes having a lower standard deviation. 
This is especially meaningful when comparing three or more dissimilar biological 
samples. 

In one embodiment, it may be preferable to rank genes having a 
moderate level of expression over genes having a lower level of expression and 
genes having a higher level of expression. This may increase the likelihood of 
identifying genes with substantial differences in the levels of expression between 
dissimilar probe samples. When all of the intensities of the individual gene 
sequences immobilized on an array have been collected from a hybridization 
experiment by a detector instrument and analyzed, a broad spectrum of indi\'idual 
gene sequences' intensities are generated. 

Surprisingly, ii has been noted by the inventors of the present 
invention that the gene sequences on a nucleic acid array that are expressed at very 
high levels of intensities, seldom demonstrate differential expression between 
dissimilar biological samples (i.e.. are typically invariant). Thus, they are unlikely 
to provide informative responses if included in an array. This is counter-intuitive 
and opposite of what one of ordinary skill in the art would expect. 
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h has also been surprisingly noted by the inventors of tiie present 
inveiuior^. that the uene seuaences that are expressed ai very low le\-els (e.^., within 2 
of the intensit\' of the back ground for detection) on a nucleic acid array, 
demonstrated the greatest potential for intensity fluctuations (e.g., percent standard 
5 deviation.-). In other v.ords, the \ery low expression level gene sequences 
demonstrate considerable "noise*' in the signal vs. noise analyses. This may be due 
to the limitations of detection at or near the background intensity of probe 
hybridization. Thus, these nois\' gene sequences may pose a problem in the anaK'sis 
and deiermination of significance of differentially expressed gene sequences. 

10 Therefore, ranking and or selecting may be biased against genes with very low- 
hybridization intensities. 

In Step I\*. the identified gene sequences may be selected and applied 
to a platform. In one embodiment, the selected gene sequences may be applied to a 
nucleic acid array filler surface (e.g., nylon filter). In another embodiment, the 

15 identified gene sequences may be applied to a glass surface (e.g., a glass slide). In 
another embodiment, the identified gene sequences may be applied to a plastic 
surface. In another embodiment, the identified gene sequences may be applied to a 
gene chip. In another embodiment, a suitable platfonn is the compact disc-like 
structure disclosed in U.S. Patent No. 5,922,617. In another embodiment, the 

20 identified gene sequences may be applied to the surfaces of individual solid beads 
(e.g., resin beads or glass beads). Other platforms may be used, as desired. 

In Step 1\'.2, gene sequences that are expressed in or are relevant to 
the area of interest are identified. For example, in one embodiment, genes that are 
expressed as differentiation biomarkers, such as, for example, in skin tissue, or skin 

25 cell t\pes (e.g., keratinocyics, melanoc\tes, and fibroblasts), or other epiihelia are 
identified. Similarly, in another embodiment, genes that are implicated in 
piiarmacologN'. toxicology, drug metabolism studies, etc. may be identified. The 
genes expressed in or reic\-ant to the area of interest may be selected for inclusion. 

Scientific literature may be consulted in order to identify these genes. 
In another embodiment, electronic databases may be consulted. For example, 
Enirez, GenBank, SwissProt, or other databases may be consulted. In yet another 
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embodiment, other sourecs may be used to identity genes (e.g., commercial! v 
available sources). 

An example of commercially a\'ailablc nucleic acid arrays thai may 
be created by the process discussed abo\e include Genefilrers" arrays, available 
from Research Genetics. Inc. of Huntsville, Alabama. For example, the nucleic acid 
arrays available include, inter alia , randomly chosen gene sequences, previously 
identified named gene sequences, prostate-specific gene sequences, ovary-specific 
gene sequences, colon-specific gene sequences, with each collection of gene 
sequences available on a single nucleic acid array product. An example of another 
commercial source of nucleic acid arrays is Affymetrix, Inc. (Palo Alto, California). 

In one embodiment, multiple individual gene sequences, or multiple 
sets of gene sequences, may be duplicated on the nucleic acid array for statistical 
analysis. By including some or all of the selected gene sequences in multiples 
(especially consisting of a minimum or three each), it becomes feasible to perform 
inter- and intra-illter statistical analyses (e.g., standard deviations) of hybridization 
intensities. It may be preferable to include a limited subset (e.g., preferably 10% or 
less) of the gene sequences on the filter in multiples (e.g., triplicates) and the 
remainder of the gene sequences in singles, so as to maximize the total number of 
different gene sequences on the nucleic acid array. This feature may be 
advantageous for the design of the informative nucleic acid array. 

In addition, multiple applications for the informative nucleic acid 
arrays of the present invention are provided. Nucleic acid arrays may be constructed 
on any appropriate immobilized material surface, such as, for example, glass, 
plastics, nylon filters, nitrocellulose filters, silicoa''metal polymers, plastics, etc. by 
robotic spotting technology or any other suitable means. The arrays may contain 
gridded nucleic acid samples (or other natural or artificial nucleic acid or PNA 
samples) of partial or full-length gene sequences that are expressed in a species of 
organism. In one embodiment, the nucleic acid arrays may be subjected to nucleic 
acid hybridization probes that are appropriately labeled (e.g., radiolabelled or 
iluorcscently-tagged) and generated from RNA that is derived from cells (e.g., 
keratinocytes), whole or partial tissues (e.g., skin, gastrointestinal tract, reproductive 
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tract, etc.). u iiolc or pania! organs, and whole or partial organisms. Probe sets ma\ 
reprcscni the entire number of probes obtained Ironi the biological sample, or ma> 
be preselected to represent a subset of tliai entire number. 

Numerous internal controls, both positive and negati\e. may be 
included on the nucleic acid array surface. The immobilized nucleic acid samples of 
specific gene sequences may be generated synthetically as oligonucleotides, PCR- 
amplitlcation products, plasmids. or from other sources. 

The hybridization e.Kperiments may include testing immobilized 
human gene sequences with human-derived probes, or immobilized rodent gene 
sequences \%ith rodent-derived probes, or homologous probing e.xpenments in other 
species, or heterologous probing experiments (e.g., immobilized human gene 
sequences with non-human primate-derived probes). The experiments may 
routinely compare a set of related probe samples from tissues or cells exposed to 
different treatments, or normal vs. diseased samples. 

The nucleic acid arrays may be subjected to multiple probing 
experiments for serial comparisons of the hybridization strengths of different 
sources of RXA (e.g., from two different tissues). 

Once the informative nucleic acid arrays are created, gene expression 
experiments may be performed. In one embodiment, computer software may be 
used to facilitate quantitation of differences in gene expression pattems betw'een 
radiolabelled experimental probe samples (cDNA from mRNA) on the nucleic acid 
arrays. As a result, gene expression experiments that took months may be 
performed significantly faster. In one embodiment, gene experiments may be 
performed in one day or less. 

The softu-are may perform data analysis to detect differential gene 
expression, hi one embodiment, statistical analyses may be provided to assess 
confidence levels for data analysis. Multiple array t\pes and multiple experiments 
may be combined into a single project for rapid, comprehensive data analysis. 

In one embodiment, Pathvvays'^"-series software, available from 
Research Genetics, Inc. of Huntsville. Alabama, may be used to perform this gene 
expression analysis. 
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In one embodiment, the infonnativ e nucleic acid arra_\s of the present 
KU'enijon ma>' be used to perform gene expression analyses on RNAs isolated from 
cultured cells, ussues. organs, or organisms. In another embodiment, informative 
nucleic acid arrav's of the present invention may be used as a diagnostic device for 
molecular pathology, molecular toxicology, molecular pharmacolog>'. etc. It may be 
used to identify signature biomarker genes and/or signature sentinel biomarker 
genes. .A. signature biomarker gene is a gene which is indicative of a cell, cell type 
or biological process and is constituitively expressed such as, for example, PSA 
gene as indicative of prostatic epithelium. A sentinel biomarker gene is a gene 
which is also indicative of a cell, cell tvpc or biological process, but is transiently 
expressed in response to a treatment such as, for example, the cvtochrome p450 
gene whose expression is effected by drug exposure. 

As discussed above, nucleic acid arrays according to the invention 
may be subjected to multiple probing experiments for serial comparisons of the 
hybridization strengths of different sources of RNA (e.g., from two different 
tissues). The general utility of nucleic acid array technology outside of the field of 
dermatology has been shown. The present invention recognizes that the application 
of nucleic acid array technology is of specific relevance and utility in dermatologic 
research, as evidenced by the following general examples. These examples are 
offered to illustrate embodiments of the invention, and should not be viewed as 
limiting the scope of the invention. 

EXAMPLES 

In order to better understand the present invention, several examples 
are provided. These examples do not limit the present invention in any way, and are 
intended to illustrate embodiments and potential applications of the present 
invention. 

Example 1: Dru^J Discoverv (//? vitro and in vivo) 

Potential new active ingredients (either chemical or biological) were 
subjected to cultured cells or to tissues (although it would require removal of a tissue 
sample). The agents were compared using the informative nucleic acid array to an 
active standard. This approach provides empirical profiles that are useful in 
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deicmiinini: the mechanism of acuon of a prcsiously known or new acii\'e 
ingredient. This inrbmiation ma> pro\-idc msiuhis into the , identirlcation of 
appropriate molecular targets for dru^' discovery. 

For a more specific example, referring to Fig. 3/ the hair growth 
5 stimulant, mmoxidil (sold commercially as Rogaine"), was applied to cultured 
normal human keratinocytes, to determine which genes are differentially expressed 
in response. Those uhich are substantially up- or down-regulated may ser\'e as 
sentinel biomarkcrs for minoxidifs action, and the same gene sequences may be 
used in subsequent experiments to find other compounds that produce a similar 
10 sentinel biomarker response. Furthermore, the gene sequences may serv'e as general 
indicators of proliferation and/or differentiation and/or toxicity status. Other 
reference drugs may be selected for similar experiments, such as retinoic acid, non- 
steroidal anti-intlammatories, etc. 

Example 2: Safety & Toxicology Testing of Products Im v/Yro and in vivo) 
15 Chemical or biological agents were subjected to cultured cells or to 

tissues (although it would require removal of a tissue sample). The agents were 
compared to an active or toxic sample. This approach provides empirical profiles 
that may be useful in determining the mechanism of action or toxicity of a new 
active ingredient. Once the reference profiles are validated, use of this nucleic acid 
20 airay methodology may reduce the need for animal and/or human in vivo clinical 
testing. 

For a more specific example, the toxic compound, hydroquinone, was 
applied to cultured melanocytic cells (e.g., a melanoma cell line), to determine 
which gene sequences are differentially expressed in response to the toxin. Those 

25 which are substantialK' up- or down-regulated may ser\'-e as sentinel biomarkers for 
hydroquinone's action, and the same gene sequences may be used in subsequent 
experiments to find other compounds that produce a similar sentinel biomarker 
response. Furthermore, the gene sequences may serve as general indicators of 
proliferation andvor differentiation and/or toxicity status. Other toxic compounds 

?{j may be used in similar experiments, 

Example 3: Comparison of Normal \ s. Diseased Tissues or Cells 
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The gene expression profiles of nonnal vs. diseased tissue or cells 
nia\ indicate the causes and or consequences of a genetic and/or induced pathologic 
disorder. For instance, the gene expression profiles of keratinocyte or lymphocyte 
cells from psoriasis patients may be compared to normal ceils from normal patients. 
5 A plurality of potential pathologic disorders could be examined, such as cancer, 
inflammation, diabetes, and other disorders. Furthermore, similar experiments may 
be conducted to determine the differences between dissimilar cell types within a 
single tissue, such as keratinocylcs, melanocytes, and fibroblasts within skin, for 
msiancc, as depicted in Fig. 2, 
1 0 Example 4: Effects of Environmental Exposure 

The comparison of treated versus untreated biological samples or 
cultured cells may reveal the effects of environmental exposure to radiation or other 
conditions, such as pressure or shear forces. For instance, exposure of biological 
samples to ultraviolet, infrared (or temperature), X-rays, or other forms of radiation 

1 5 may reveal the consequences of exposure and potential damage. 
Examples: Signature or Sentinel Biomarker Databases 

The advances in the Human Genome Project (and with other species) 
have opened doors of opportunity to catalog for the first time on a very large scale 
large numbers (e.g., thousands) of the expressed gene sequences for individual cell 

20 t\pes, tissue tvpes, organs, and for individual pathological disorders. The large data 
sets obtained using informative nucleic acid arrays may establish valuable reference 
databases of signature and/or sentinel biomarker gene sequences for use in various 
other cxpenments. The raw and subsequently analyzed data sets can include various 
features, such as individual gene hybridization intensities and/or differential 

25 expression ratios between biological samples, among other possibilities. 
Furthermore, the gene lists, per se. used in designing the informative nucleic acid 
arrays may also serve as valuable biomarker databases. 
Example 6: Designing Diagnostic Devices 

The information gathered in experiments using informative nucleic 

30 acid arrays may be used to select one or more gene sequences for inclusion in the 
de\'elopment and design of medical diagnostic devices for pathological analyses. 
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The devices may contain one or more yene sequences in a product resembling an 
infomiaiive nucleic acid arra;/. or ma\' be more limited in scope (e.g.. containing 
perhaps -lOf* gene sequences instead of thousands of gene sequences). The de\'ices 
may also detect protein products, that are the results of gene expression from the 
5 gene sequences identirled usmg the informative nucleic acid arrays. The use of the 
inromiati\e nucleic acid arrays (or databases, above) helps in ranking, priontizing, 
and selecting potential diagnostic biomarkcrs from a large plurality of candidate 
gene sequences. 

Example 7: Potential Gene sequences 

10 As an example a list of poiential gene sequences for inclusion in a 

dermatologic nucleic acid array follows. Some or all of these gene sequences (or 
ponions thereoO niay be immobilized in nucleic acid arrays, and might be additions 
to nucleic acid array products, such as GeneFilters^ prepared by Research Genetics, 
Inc. An example incorporating some or all of the gene sequences within this list is 

15 the DenmArray^^^ nucleic acid array produced by IntegriDerm, Inc. of Huntsville, 
Alabama, and the inventors of the present invention. 



Examples of Gene sequences of Relevance for Inclusion in a Dermatologic Array 



f' Gene Catesorv 


Gene sequences 


1 KeratinocMo-specific 

i 


Keratin 5 & 14 (basal); Keratin 1 & 10 (suprabasal); Keratin 6 & iO 
(h\perprolifcrative): Hair keratins; SI 00 gene family (e.g. calpactin 1 
light chain, psoriasin); Involucrin; Loricrin: Profillagrin; Envoplakin; 
Desmin: Cadherins; Transglutaminase; Integrins; Protein Kinase C 
eta; Rctinoic acid receptors; Proline-rich repeats 


i; Meianocyte-spccific 


alpha-Melanocyte stimulating hormone; Agouti signal protein; 
Melanocortin receptor; Adenylate cyclase; Microopthalmia 
transcription factor: Tyrosinase; Tyrosinase-related protein 2 
(DOPAchrome Tautomerase); Tyrosinase-related protein 1 (DHIC.A. 
Oxidase): Catalase; Phenylalanine hydroxylase 


Fibroblast-specific and 
extracellular matrix- 
specific 


Collagens; Collagenases; Laminin; Elastin; Elastase; alphal- 
Anti trypsin; Fibronectin: Matrix metalloproteinases; Tissue inhibitor 
of metalloproteinases 


1 Immunomodulatorv 

! 


Interleukins; Interferons; (prostaglandin synthesis); (leukotriene 

syiMhcsis) 


Cell cycle and apoptosis | p53: plo; p21 
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Growth factors and 
sccondar>' messenger 
Signaling 


Mitogen-acti\'atcd protein kinase; Tvrosine kinases: G-proteins; 
Epidcnnal growtii factor: EGF receptor: B Fibroblast growth factor; 
FGF-5; FGF receptors; Estrogen receptor: PCNA; c-Ha-ras; c-nv/c; 
Ornithine Decarboxylase 


Transcription factors 


c-fos; c-jun; AP-1 


; Endothelium 


.Ajigiotcnsin-converting enzyme; VonWillebrand Factor; Endothelm; 
Endothelin receptor 


; Xenobiotic metaboHsm: 


Cytochrome P450 genes; Sulfotransferases; Glutathione 
sulfurtransferascs: UDP Glucuronosyltransfcrascs: Rhodanese; 
Polycyclic aromatic hydrocarbon receptor; Peroxisome proliferator 


I Other 


Nitric Oxide Svnthase; Dopamine receptors; Chaperonins (HSPs); 
Senescence markers; DNA repair (ERCG); PhotoKase; 
Metallothionine; Vlulti-drug resistance; MRP 


Controls 


Actin; Ribosomal protems: GAPDH; Mitochondrial proteins; 'lotal 
genomic DNA" 



Examples: Differentially Expressed Gene Plots 



Referring to Fig. 2 and 3, plots of differential expressed genes are 
provided. In Fig. 2, an example of differentially expressed genes detected using an 
informative array according to one embodiment of the present invention is provided. 
5 The plot of an informative nucleic acid array identifies hundreds of genes (shown as 
individual dots) that arc differentially expressed when comparing the standardized 
hybridization intensities of the radiolabeled probes from the two most abundant 
human skin-derived cell tvpes in culture (i.e., keratinoc\tes and fibroblasts). 
Differentially expressed genes diverge substantially from the diagonal (i.e., 1 X- 
10 axis). In this example, the differentially expressed genes represent signature 
biomarker genes for normal fibroblasts (upper left) and normal keratinocytes (lower 
right). 

Fig. 3 depicts a plot of sentinel biomarker genes of minoxidil-treated 
cultured normal human keratinocytes are plotted as a function of minoxidil 
15 concentration (log-log scale). Each line represents an individual gene's differential 
expression ratio identified in hybridization experiments using radiolabeled probes 
from each test concentration of minoxidil. Note that some sentinel genes are up- 
regulated and other sentinel biomarkers are down-regulated in a dose-dependent 
manner. 
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CLAIMS 

We claim; 

1- A method for identifying a pluralit\' of genes that arc differentialK- 
expressed between dissimilar biological samples for use in an informative arrav. 
comprising: 

pro\ iding a first set of heterogeneous nucleic acid probes derived from a first 
biological sample; 

pro\ iding a second set of heterogeneous nucleic acid probes derived from a 
second biological sample wherein the first and second biological samples arc 
different and lia\ e a common biological process; 

hybridizing a nucleic acid array comprising a plurality of sequences derived 
from genes of said biological process with the first set of probes and determining a 
first level of expression for sequences of the array; 

hybridizing the array with said second set of probes and determining a 
second level of expression for sequences of the array; and 

identifying a plurality of genes that are differentially expressed in said 
biological process by comparing the first level of expression with said second level 
of expression for hybridized sequences. 

2. The method of claim 1, wherein the biological process is selected 
from the group consisting of the processes of adsorption, distribution, metabolism, 
and excretion of drugs, toxins and chemicals. 

3. The method of claim 1, wherein the biological process affects the 
behavior of endogenous or exogenous chemicals in cells. 

4. The method of claim 1, wherein the biological process is 
substantially related to dermatology, pharmacology, toxicology, pathology, 
oncology, gastroenterology, urology, or gynecology. 

5. The method of claim 1, wherein the first and second biological 
samples are selected from the group consisting of samples of skin, skin appendages, 
oral tissue, gastrointestinal tissue, neural tissue, renal tissue, hepatic tissue, 
urogenital tissue and combinations thereof 
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Other embodiments and uses of the invention u ill be apparent to 
those skilled in the 2": trom consideration of the specification and practice of the 
invention disclosed herem. All references cited herein, includu^^g all U.S. and 
foreign patents and patent applications, arc specificall\ and entirely hereby 
incorporated herein by reference, including U.S. provisional application number 
50,176,022 and U.S. provisional application number 60/ 1^^7.991. U.S. Patent No. 
5,707,807, U.S. Patent No. 5.96S.784, U.S. Patent No. 5,807,522, and U.S. Patent 
No. 5,922,617. It is intended that the specification and examples be considered 
exemplary only, with the true scope and spirit of the invention indicated by the 
following claims. 



6. The n;e:hod of claim L wherein the first and second biological 
samples are selected from liic group consisting of samples ofccUs. cell cultures, cell 
hnes, tissues, organ-, organelles, biological fluids, whole organisms and 
combinations thereof. 

7. The meihod of claim 1, whcreui the differentially e.xpressed genes are 
derived from genes of epilhelial cells or tissues. 

S. The method of claim 1. wherein the differentially expressed genes are 
derived from genes of cells selected from the group consisting of skin, skin 
appendages, oral tissue, gastrointestinal tissue, neural tissue, renal tissue, hepatic 
tissue, urogenital tissue and combinations thereof. 

9. The method of claim I, wherein the step of identifying comprises: 
quantifying the difference in the first level of expression and the second level 

of expression for hybridi/xd sequences by determining a hybridization intensity 
difference between the first level of expression and the second level of expression, 
establishing a ratio between the first level of expression and the second level of 
expression, establishing a ratio threshold for the first level of expression and the 
second level of expression, determining a standard deviation for the first level of 
expression and the second level of expression, and determining a percent standard 
deviation for the first le\'el of expression and the second level of expression. 

10. The method of claim I, wherein the step of identifying comprises: 
quantifying the difference in the first level of expression and the second level 

of expression for hybridized sequences by applying an algoritlmi to the first level of 
expression and the second level of expression, performing cluster analysis on the 
first level of expression and the second level of expression, or performing 
dendrogram analysis on the first level of expression and the second level of 
expression. 

1 1. The method of claim 1, further comprising the step of hybridizing the 
array with three or more sets of probes, each derived from a different biological 
sample, and determining a level of expression with regard to hybridization to 
sequences of the array. 
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12. The method of claim 1, further comprising the step of establishing a 
ranking of the identihcd genes that are differentially expressed. 

13. The method of clami 12, w herein ihe step of establishing a ranking of 
the identified genes thiat are differential!}' expressed comprises: 

ranking the identified genes based on a difference between the first le\el of 
expression and the second lev el of expression for identified genes. 

14. The method of claim 12, further comprising the step of 
determining an absolute value for the difference between the first le\'el of 

expression and the second lev el of expression for the identified genes, and 

ranking genes havmg a higher difference over genes having a lower 
difference. 

15. The method of claim 12, wherein the step of establishing a ranking of 
the identified genes that arc differentially expressed comprises: 

ranking the identified genes based on a ratio of a larger of the first level of 
expression and the second level of expression to a lower of the first level of 
expression and the second level of expression. 

16. The method of claim 15, wherein genes having a ratio further from 
1 .0 arc ranked over genes having a ratio closer to 1 .0. 

1 7. The method of claim 12, wherein the step of establishing a ranking of 
the identified genes that are differentially expressed comprises: 

ranking the identified genes based on a standard deviation for the first level 
of expression and the second level of expression for identified genes. 

18, The method of claim 17, wherein genes having a higher standard 
deviation are ranked over genes having a low'er standard deviation. 

19. The method of claim 12, whcrem the step of establishing a ranking of 
the identified genes that are differentially expressed comprises: 

ranking the identified genes based on at least one of the first level of 
expression and the second level of expression for identified genes, wherein genes 
having a moderate level of expression are ranked over genes having a lower le\'el of 
expression and a higher lev el of expression. 



2(' The meiiiod of claim l'^^. therein ycnes havinii a \'cr\' low ie\ei o[' 
expression and licp.os ;:a\ in^ a \ cr;. iiiyh lev el of expression are eliminated before 
ranking. 

21. The method of claim 12, further comprising selecting ranked 
5 differential genes based on at least one criterion selected from the group consisting 

of a dirferencc between the first level of expression and the second le\el of 
expression for identified genes, a ratio of a larger of the first level of expression and 
the second level of expression to a lower of the first level of expression and the 
second le\ el of expression, and a standard deviation for the first level of expression 
10 and the second level of expression for identified genes. 

22. The method of claim 21, wherein genes having a moderate le\'el of 
expression are selected over genes having a lower level of expression and a higher 
level of expression. 

23. A database comprising a plurality of ranked differcntiallv' expressed 
15 genes identified by the method of claim 12. 

24. The database of claim 23, further comprising additional differentially 
expressed genes that are substantially relevant to the biological process. 

25. An informati\ c array of sequences on a solid support wherein said 
sequences are derived irom a plurality of differentially expressed genes identified by 

20 the method of claim 1 . 

2b. A method for selecting a plurality of genes for an informati\'e nucleic 
acid array for a biological process, comprising: 

ideniif\'ing genes that are differentially expressed in the biological process; 
establishing a ranking of the differentially expressed genes, wherein genes 
25 having a moderate level of expression are ranked over genes ha\'ing a lower le\ el of 
expression and genes ha\ ing a higher level of expression; and 

placing sequences derived from ranked differentially expressed genes on the 
informative array. 

27. The method of claim 26, wherein the genes are expressed in the 
3'J biological process. 

2S. The method of claim 26, wherein the step of identifying comprises: 
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pro\'iding a first set of nucleic acid probes derived from a first biological 

sampie; 

pro\iding a second set of nucleic acid probes derived from a second 
hiologieal sample, uherem said second biological sample is dissimilar fron: said 
first biological sampie: 

hybridizing the first set of probes to the microarray and dctcniiining a first 
level of expression for h\ bndized genes; 

h>bndizing the second set of probes to the microarray and detennining a 
second le\ e! of expression for hybridized genes; and 

identifying genes that arc differentially expressed by comparing the first 
level of expression with the second level of expression for hybridized genes. 

29. The method of claim 26, wherein the step of identifying comprises: 
quantifying the difference in the first level of expression and the second level 

of expression for hybridized genes by determining a higher hybridization intensity 
difference between the first le\ el of expression and the second level of expression, 
establishing a ratio between the first level of expression and the second le\*el of 
expression, establishing a ratio threshold for the first level of expression and the 
second level of expression, detennining a higher standard deviation for the first level 
of expression and the second le\*el of expression, or determining a higher percent 
standard de\'iation for the first level of expression and the second level of 
expression. 

30. The method of claim 26, wherein the step of identifying comprises: 
quantifying the difference in the first level of expression and the second level 

of expression for hybridized genes by applying an algorithm to the first le\-el of 
expression and the second level of expression, perfotrning cluster analysis on the 
first level of expression and the second level of expression, or performing 
dendrogram analysis on the first level of expression and the second le\'el of 
expression. 

31. The method of claim 26. further comprising: 

appKing the selected genes to a filter surface, a glass surface, a plastic 
surface, a solid bead, or a gene chip surface. 



32. The method of claim 26. w herein the step of establishing a ranking 
comprises employing at least one analytical method lo rank the differcniialh* 
expressed genes. 

33. The method of claim 20, further comprising the step of: 

using the informative nucleic acid arTa>' to perfomi gene expression analyses 
on RNA isolated Iro:" cells, cell cultures, cell lines, tissues, organs, organelles, 
biological fluids or w hole organism.s. 

34. The method of claim 20. further comprising the step of: 

using the arra>' to identify at least one signature biomarkcr gene, wherein the 
signature biomarker gene is expressed in a cell-specific maruier, a differentiation- 
specific manner, or a tissue-specific manner, 

35. The method of claim 26, further comprising the step of: 

using the informative nucleic acid array to identify at least one sentinel 
biomarker gene, wherein the sentinel biomarker gene is expressed in response to 
treatment by an agent which comprises a drug, a toxin, a chemical, a temperature, a 
pressure, or electromagnetic radiation, 

36. A method for converting a nucleic acid array into an infom"iative 
array comprising: 

providing a first set of heterogeneous nucleic acid probes derived from a first 
biological sample; 

providing a different, second set of heterogeneous nucleic acid probes 
derived from a second biological sample; 

hybridizing a nucleic acid array comprising a plurality of sequences with the 
first set of probes and detennining a first level of expression for sequences of the 
array; 

hybridizing the arra>' with said second set of probes and determining a 
second level of expression for sequences of the array; 

identifying a plurality of genes that are differentially expressed in said 
biological process by comparing the first level of expression with said second level 
of expression for hybndizcd sequences; and 
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selecting moderately expressed genes from the pliiralit\' of icleniifiec 
di fferentialpv expressed genes for inclusion on the informative array. 

37. An mtbrmaiiAe nucleic acid array for gone expression anal\'si5 
comprising: 

5 sequences deri\ ed from genes of cells that are substantially relevant to a 

biological process; 

sequences deri\ ed from a plurahty of differentially expressed genes that are 
rele\ ant to the biological process as identified by the method of claim 1; and 

a platform for the sequences which is selected from the group consisting of a 
10 filter surface, a glass surface, a plastic surface, a solid bead surface, and a gene chip 
surface. ' 

38. The nucleic acid array of claim 37, wherein at least one of the 
expressed genes and the differentially expressed genes comprise a moderateK* 
expressed gene. 

15 39. The array of claim 37, wherein the differentially expressed genes are 

derived from genes of epithelium cells or tissue. 

40. The array of claim 39, wherein the epithelium is obtained from skin, 
skin appendages, oral tissue, gastrointestinal tract or urogenital tissue. 

41 . The airay of claim 37, wherein the biological process is selected from 
20 the group consisting of the cellular processes of adsorption, distribution. 

metabolism, and excretion of drugs, toxins and chemicals. 

42. The array of claim 37, wherein the biological process affects the 
behavior of endogenous or exogenous chemicals in cells. 

43. The array of claim 37, wherein the biological process is specific to 
25 dcrmatolog\. pharmacolog)*, toxicology, pathology, oncology, gastroenterologx , 

urology, or gynecology. 

44. The array of claim 37, wherein the cells are selected from the group 
consisting of cells derived from skin, skin appendages, oral tissue, gastrointestinal 
tissue, neural tissue, renal tissue, hepatic tissue or urogenital tissue. 

30 
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Provide Biological Sampies 
Cell culture samples (e.g.. ce-i ar.es. primary culrures. co-culaires, etc.) 
Tissue samples < e.^.. biopsies, necropsies, e.x vivo, eic.) 

T rearmenLS of cell culcures and or tissue samples le.g., drugs, :oxms, electromagnetic radiation. 

etc. 1 ^ 

I 

II, Perform Nucleic Acid Array Experiments 

1. Prepare RN.\ samples trom biological samples ( irom step I) 

2. Prepare labeled cDNa probes (e.g.. radioisotope, fluorescent, etc.) 

3. Mybndize probes to available nucleic acid arrays (e.g., contaming thousands otMmmobilized 
genes) and remove unbound probe 

4. Collect hybridization intensities as raw data with a detector device 



III. Perform Data Analysis | 
! 1. Standardize raw data hybridization intensities, with regard to controls, standards, and/'or between 

^ data sets j 

; 2. Detenrmne individual gene e.xpression ratios by comparing standardized mtensities of individual : 

genes between data sets t 

■ 3. Rank or bias for moderately-expressed genes regardless of ratios (e.g., bias against very low and I 

■ very high intensity mdividual genes) ' 
I 4. Rank differentially-expressed genes based on ratios alone or with additional anaKtical methods [ 

(e.g., arbitrary ratio threshold, algorithm, graphical, cluster, dendrogram, etc.), thus identifying the ! 

i most over- or under-expressed individual genes i 

I 

( IV. Design of Informative Nucleic Acid .Arrays 

; 1. Select individual genes that are expressed in or relevant to the desired application 

; 2. Select informative differentially-expressed individual genes (from expenmenial results in step III), 

I representmg one or more groups of biomarkers: | 

i • "Signaaire" biomarkers of cell r.pes and tissue types | 

i • ''Sentinel" biomarkers of treatments | 

\ 3. Prepare replicates of some or ail of the selected genes (from #1 & 2) for statistical analysis 

! purposes I 

i 4. Immobilize nucleic acid samples of mdividual selected genes (complete or panial) on a nucleic | 

I acid arrav matnx (e.^., tllters, elass. c hips, etc.) to produce an inlbrmative nucleic acid array _J 

; ^ 

V, Use of the Inlormative Nucleic Acid Arravs 

! • Repeat steps I, II. and III using an informative nucleic acid anay (from step IV) to obtain ] 

experimental results {e.g. signature and/or sentinel biomarkers) . 

! • Optionally, nay validate gene expression results obtained in step III using one or more methods j 

j (e.g., RT-PCR, Nonhem blots. Real Time PGR, proteomic analysis, immunodetection, etc.) ! 
: • Optionally, may repeat step IV using the experimental results to produce denvative and'or refined 

' informative nucleic acid arrays 
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